Prediction of function divergence in protein families using the substitution rate variation parameter alpha.
نویسندگان
چکیده
Protein families typically embody a range of related functions and may thus be decomposed into subfamilies with, for example, distinct substrate specificities. Detection of functionally divergent subfamilies is possible by methods for recognizing branches of adaptive evolution in a gene tree. As the number of genome sequences is growing rapidly, it is highly desirable to automatically detect subfamily function divergence. To this end, we here introduce a method for large-scale prediction of function divergence within protein families. It is called the alpha shift measure (ASM) as it is based on detecting a shift in the shape parameter (alpha [alpha]) of the substitution rate gamma distribution. Four different methods for estimating alpha were investigated. We benchmarked the accuracy of ASM using function annotation from Enzyme Commission numbers within Pfam protein families divided into subfamilies by the automatic tree-based method BETE. In a test using 563 subfamily pairs in 162 families, ASM outperformed functional site-based methods using rate or conservation shifting (rate shift measure [RSM] and conservation shift measure [CSM]). The best results were obtained using the "GZ-Gamma" method for estimating alpha. By combining ASM with RSM and CSM using linear discriminant analysis, the prediction accuracy was further improved.
منابع مشابه
Substitutional Analysis of Orthologous Protein Families Using BLOCKS
Orthologous proteins, form due to divergence of parental sequence, perform similar function under different environmental and biological conditions. Amino acid changes at locus specific positions form hetero-pairs whose role in BLOCK evolution is yet to be understood. We involve eight protein BLOCKs of known divergence rate to gain insight into the role of hetero-pairs in evolution. Our procedu...
متن کاملAnalysis of NSP4 Gene and Its Association with Genotyping of Rotavirus Group A in Stool Samples
Background: Non-structural protein 4 (NSP4) is a critical protein for rotavirus (RV) replication and assembly. This protein has multiple domains and motifs that predispose its function and activity. NSP4 has a sequence divergence in human and animal RVs. Recently, 14 genotypes (E1-E14) of NSP4 have been identified, and E1 and E2 have been shown to be the most common genotypes in human. Methods:...
متن کاملExploring the Relationships between Mutation Rates, Life History, Genome Size, Environment, and Species Richness in Flowering Plants.
A new view is emerging of the interplay between mutation at the genomic level, substitution at the population level, and diversification at the lineage level. Many studies have suggested that rate of molecular evolution is linked to rate of diversification, but few have evaluated competing hypotheses. By analyzing sequences from 130 families of angiosperms, we show that variation in the synonym...
متن کاملGenomic Determinants of Protein Evolution and Polymorphism in Arabidopsis
Recent results from Drosophila suggest that positive selection has a substantial impact on genomic patterns of polymorphism and divergence. However, species with smaller population sizes and/or stronger population structure may not be expected to exhibit Drosophila-like patterns of sequence variation. We test this prediction and identify determinants of levels of polymorphism and rates of prote...
متن کاملExistence and uniqueness of weak solutions for a class of nonlinear divergence type diffusion equations
In this paper, we study the Neumann boundary value problem of a class of nonlinear divergence type diffusion equations. By a priori estimates, difference and variation techniques, we establish the existence and uniqueness of weak solutions of this problem.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Molecular biology and evolution
دوره 23 7 شماره
صفحات -
تاریخ انتشار 2006